Image Categorization Using Hierarchical Spatial Matching Kernel

نویسندگان

  • Tam T. LE
  • Yousun KANG
  • Akihiro SUGIMOTO
چکیده

Spatial pyramid matching (SPM) has been an important approach to image categorization. This method partitions the image into increasingly fine sub-regions and computes histograms of local features at each sub-region. Although SPM is an efficient extension of an unordered bag-of-features image representation, it still measures the similarity between sub-regions by application of the bag-of-features model. Therefore, it is limited in its capacity to achieve optimal matching between sets of unordered features. To overcome this limitation, we propose a hierarchical spatial matching kernel (HSMK) that uses a coarse-to-fine model for the sub-regions to obtain better optimal matching approximations. Our proposed kernel can deal robustly with unordered feature sets as well as various cardinalities. In experiments, results of HSMK outperformed those of SPM and led to state-of-the-art performance on several well-known databases of benchmarks in image categorization, even though we use only a single type of image feature.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recognizing in the depth: Selective 3D Spatial Pyramid Matching Kernel for object and scene categorization

This paper proposes a novel approach to recognize object and scene categories in depth images. We introduce a Bag of Words (BoW) representation in 3D, the Selective 3D Spatial Pyramid Matching Kernel (3DSPMK). It starts quantizing 3D local descriptors, computed from point clouds, to build a vocabulary of 3D visual words. This codebook is used to build the 3DSPMK, which starts partitioning a wor...

متن کامل

Spatial Fisher Vectors for Image Categorization

We introduce an extension of bag-of-words image representations to encode spatial layout. Using the Fisher kernel framework we derive a representation that encodes the spatial mean and the variance of image regions associated with visual words. We extend this representation by using a Gaussian mixture model to encode spatial layout, and show that this model is related to a soft-assign version o...

متن کامل

Static Image Classification based on ScSPM and LBP histogram Fourier (LBP-HF) Features

In the recent digital age, support vector machines (SVMs) that use a spatial pyramid matching (SPM) kernel have been around the globe for image classification. Although this is popular, there exists many problems in its use; for examples, nonlinear SVMs have a high complexity in training and testing. Applying the algorithms to big datasets, which holds many images, greater than a thousand is a ...

متن کامل

A Multi-Scale Learning Framework for Visual Categorization

Spatial pyramid matching has recently become a promising technique for image classification. Despite its success and popularity, no prior work has tackled the problem of learning the optimal spatial pyramid representation for the given image data and the associated object category. We propose a Multiple Scale Learning (MSL) framework to learn the best weights for each scale in the pyramid. Our ...

متن کامل

تحلیل حرکت جریانات دریائی در تصاویر حرارتی سطح آب دریا

Oceanographic images obtained from environmental satellites by a wide range of sensors allow characterizing natural phenomena through different physical measurements. For instance Sea Surface Temperature (SST) images, altimetry data and ocean color data can be used for characterizing currents and vortex structures in the ocean. The purpose of this thesis is to derive a relatively complete frame...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013